A speech enhancement algorithm based on a non-negative hidden Markov model and Kullback-Leibler divergence
نویسندگان
چکیده
Abstract In this paper, we propose a supervised single-channel speech enhancement method that combines Kullback-Leibler (KL) divergence-based non-negative matrix factorization (NMF) and hidden Markov model (NMF-HMM). With the integration of HMM, temporal dynamics information signals can be taken into account. This includes training stage an stage. stage, sum Poisson distribution, leading to KL divergence measure, is used as observation for each state HMM. ensures computationally efficient multiplicative update parameter model. online novel minimum mean square error estimator proposed NMF-HMM. implemented using parallel computing, reducing time complexity. Moreover, compared traditional NMF-based methods, experimental results show our algorithm improved short-time objective intelligibility perceptual evaluation quality by 5% 0.18, respectively.
منابع مشابه
Dysarthric Speech Recognition Using Kullback-Leibler Divergence-Based Hidden Markov Model
Dysarthria is a neuro-motor speech disorder that impedes the physical production of speech. Patients with dysarthria often have trouble in pronouncing certain sounds, resulting in undesirable phonetic variation. Current automatic speech recognition systems designed for the general public are ineffective for dysarthric sufferers due to the phonetic variation. In this paper, we investigate dysart...
متن کاملModel Confidence Set Based on Kullback-Leibler Divergence Distance
Consider the problem of estimating true density, h(.) based upon a random sample X1,…, Xn. In general, h(.)is approximated using an appropriate in some sense, see below) model fƟ(x). This article using Vuong's (1989) test along with a collection of k(> 2) non-nested models constructs a set of appropriate models, say model confidence set, for unknown model h(.).Application of such confide...
متن کاملSparse Non-negative Matrix Factorization with Generalized Kullback-Leibler Divergence
Non-negative Matrix Factorization (NMF), especially with sparseness constraints, plays a critically important role in data engineering and machine learning. Hoyer (2004) presented an algorithm to compute NMF with exact sparseness constraints. The exact sparseness constraints depends on a projection operator. In the present work, we first give a very simple counterexample, for which the projecti...
متن کاملMarkov-switching model selection using Kullback–Leibler divergence
In Markov-switching regression models, we use Kullback–Leibler (KL) divergence between the true and candidate models to select the number of states and variables simultaneously. Specifically, we derive a new information criterion, Markov switching criterion (MSC), which is an estimate of KL divergence. MSC imposes an appropriate penalty to mitigate the overretention of states in the Markov chai...
متن کاملSpeech enhancement based on hidden Markov model using sparse code shrinkage
This paper presents a new hidden Markov model-based (HMM-based) speech enhancement framework based on the independent component analysis (ICA). We propose analytical procedures for training clean speech and noise models by the Baum re-estimation algorithm and present a Maximum a posterior (MAP) estimator based on Laplace-Gaussian (for clean speech and noise respectively) combination in the HMM ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Eurasip Journal on Audio, Speech, and Music Processing
سال: 2022
ISSN: ['1687-4722', '1687-4714']
DOI: https://doi.org/10.1186/s13636-022-00256-5